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Abstract 

If a quantum system of Hilbert space dimension mn is in a random pure 
state, the average entropy of a subsystem of dimension m < n is conjectured 
to be 5 m , n = YJk=n+\ | ~ an d is shown to be ~ In m - || for 1 <C m < n. 



^ ■ Thus there is less than one-half unit of information, on average, in the smaller 



C3 



subsystem of a total system in a random pure state. 
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One natural way to get entropy, even for a system in a pure quantum state, is 
to make the coarse graining of dividing the system into subsystems and ignoring 
their correlations. For example, suppose a system AB with Hilbert space dimension 
mn and normalized density matrix p (a pure state p = \if))(^\ if p 2 = p, which is 
equivalent to Trp 2 = 1) is divided into two subsystems, A and B, of dimension 
m and n respectively. (Without loss of generality, take the first subsystem, A, to 
be the one with the not larger dimension, so m < n.) The density matrix of each 
subsystem is obtained by the partial trace of the full density matrix p over the other 
subsystem, so 

p A = tr B p, (1) 
Pb = tr A p. (2) 

The entropy of each subsystem is 

S A = -trp A \iap A , S B = -trp B \np B . (3) 

Unless the two systems are uncorrelated in the quantum sense (which corresponds 
to the case that p = Pa® Pb), the sum of the entropies of the subsystems, which 
is a coarse graining that ignores the correlations, is greater than the fine-grained 
entropy Sab of the total system: 

Sa + S B > Sab = -trphxp. (4) 

In fact, the three entropies Sa, S b , and Sab obey the triangle inequality so if 
the entire system AB is in a pure state, which has Sab = 0, then S A = S B , which 
is an immediate consequence of the well-known fact that pA and p B then have the 
same set of nonzero eigenvalues. 

It may be of interest to calculate how much entropy one typically gets by this 
coarse graining 0, |3|, |4|, [5|]. For example, what is the average, which I shall call 

S m ,n = (Sa), (5) 
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of the entropy Sa over all pure states p = \i/j){if)\ of the total system? Here the 
average is defined with respect to the unitarily invariant Haar measure on the space 
of unit vectors in the mn-dimensional Hilbert space of the total system, which 
is proportional to the standard geometric hypersurface volume on the unit sphere 
g2mn-i w h}ch those unit vectors give when the mn-complex-dimensional Hilbert 
space is viewed as the 2mn-real-dimensional Euclidean space ||. Note that since 
Sa is a nonlinear function of the density matrix pa, the average {Sa) of this entropy 
function is not the same as this function evaluated for the average density matrix 
{Pa) —1/m (the identity matrix acting on the subsystem Hilbert space, divided by 
its dimension m), which would be an entropy of S max = mm, the maximum entropy 
the subsystem A can have. It is convenient to define the average information of the 
subsystem as the deficit of the average entropy from the maximum, 

Im,n = S max {Sa) = lnm, S mjn . (6) 

Lubkin |2| calculated that, in my notation, 

/+ 2\ m + n 

\ tr PA) = XT' 7 

mn + 1 

and hence he estimated that for m <C n, 

HlP — 1 

S min ~\nm- — . (8) 

2mn + 2 

However, he was unable to calculate S m , n exactly Lloyd and Pagels [|J, apparently 
unaware of Lubkin's work, as I was also when I did my calculations, made progress 
from a slightly different angle by calculating the probability distribution of the 
eigenvalues of pa for random pure states p of the entire system. The result, after 
inserting the differentials that were used in the calculation || but which were not 
given explicitly in the paper H], and after changing variables to the eigenvalues pt, 
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IS 

m m 

P(p 1 ,...,p m )d Pl ...dp m oc5(l-Y / Pi) II (Pi-Pj) 2 T[(Pk~ md Pk)- (9) 

1 = 1 l<i<J <7Tl fc = l 

The normalization constant for this probability distribution is given only implicitly 
by the requirement that the total probability integrate to unity. Although they also 
did not calculate S m , n exactly, Lloyd and Pagels |§ came to the same qualitative 
conclusion as Lubkin ||, that for mCn the typical entropy of the (much) smaller 
subsystem is very nearly maximal. 

Here I shall show that for 1 <C m < n, 

in 

S mjU ~lnm- — , (10) 

which agrees with Eq. (8) above from Lubkin in the region of overlap but does 
not require m <C n. An exact calculation for m = 2 and m = 3 led me to the 
tentative conjecture that the exact general formula for m < n is 

mn 1 m—1 

S m ,n = l ' ( U ) 

k=n+l K LU 

which rather remarkably agrees with what I later calculated exactly for m = 4 and 
m = 5, so I now think it would be surprising if it were not always correct, though 
I have not yet found a proof for this conjecture. Furthermore, for large n, the 
asymptotic expansion for Eq. (11) is 

m 2 -l ^2. m 2 i-l 

Sm,n = lnm - — + 2^ B 2j — , 12 

Ivan ~[ Ijm 3 nr 3 

where B 2 j are the Bernoulli numbers, which fits both Eqs. (8) and (10) when n 3> 1. 
To calculate S m ,m h is convenient to define Qi = rpi and 

m 

Q(q 1 ,...,q m )dq 1 ...dq m = JJ (ft - qj f J{{e- qi q^ m dq^ 

l<i<j'<m i=l 

oc e- r r mn - 1 P(p 1 ,...,p m )dp 1 ...dp m _ 1 dr. (13) 
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Then 



/m 
(YjPi \ n Pi)P(Pl, ■ ■ -,Pm)dpi . . .p m -i 
i=l 

,, . ^ J(E£ig»lngj)Qrfgi ...dq m , , 

= ipimn+l) — — , 14 

mn J Qdq 1 . . . dq m 

using, for integral N, 

fOO N i 

/ e- x x N \nxdx = Mi>(N + l) = r'(N + l) = M(Y,T-C), (15) 

k=i k 

where C is Euler's constant, which, after some algebra, one can see cancels out from 
the final expression for S m , ni leaving a rational number for each pair of integers m 
and n. 

One can readily calculate by hand that for n > 2, 

2n-l i 

S2,n= E ^ (16) 
k=n+l ft 

which, for example, gives = 1/3, slightly less than one-half ^maa, = In 2 in that 
case, and for n > 3, 

3 ™ 1 1 

^3,n= El"". (17) 
fc=n+l K 71 

both of which are fit by Eq. (11), which they suggested as a generalization. For 
m > 3, the expression for Q is too cumbersome for Eq. (14) to be readily evaluated 
by hand, but I was able to calculate it for m = 4 and m = 5 with the aid of 
Mathematica 2.0, after putting in by hand the correct value of the integral of Eq. 
(15), which Mathematica 2.0 evaluates incorrectly. Both of these values of m also 
fit Eq. (11). For m > 5 I ran into another apparent bug in Mathematica 2.0 which 
I have not yet figured out how to circumvent, but the likelihood that the agreement 
of my calculations of S^ n and S^ n with Eq. (11) is due to accident or error seems 
less than the likelihood that my conjectured Eq. (11) is in fact exact for all m. 
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However, because I have not yet found any proof of Eq. (11), it appears worth- 
while to derive an approximate expression for S m ^ n for large m and n, which I now 
do. In this limit, 

S m ,n ^ -^pMVi = -22 — ln — ( 18 ) 

i=l i=l r r 

for p^s which maximize P(pi . . .p m ) or q^s which maximize Q(qi, . . . , q m ). Now 

m 

- \nQ(q 1 , . . . ,q m ) = - ^ ln(% - qj) 2 + ~ ( n ~ m ) ln %] ( 19 ) 

l<i<j'<m i=l 

is the potential energy for m unit charges on the g-line in two dimensions due to their 
mutual electrostatic repulsion, a uniform external unit electric field in the negative 
q direction, and another superposed external electric field of strength (n — m) /q in 
the positive q direction from n — m external charges fixed at the origin. 

For large m and n, we can also make the continuum approximation of ma(x)dx 
"charges" (eigenvalues of the "most typical" density matrix pa) in the range dx of 
the rescaled variable x = q/m, so a is a normalized linear density (with respect to 
x) of eigenvalues. The equilibrium condition (maximization of Q or minimization of 
the electrostatic energy — InQ) gives the integral equation, with w = (n — m)/m, 

r a(x)dx = w-x' 
J x — x' x' 

for x' in the range where a(x') > 0, say < a < x' < b for some constants a and b 
that would depend on w. This is a Fredholm equation of the first kind [0], but with 
the troublesome feature that the right hand side is only given for part of the real 
axis, a < x' < b, where cr(x') > 0. 

The solution can be found as a special case of the Riemann-Hilbert problem: 
"to find a function, harmonic in a certain plane region D, assuming that on some 
parts of its contour we are given the values of the required function, and on others 
the values of its normal derivative" ||. Here we want the charge density, which is 
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(minus) the normal derivative of the electrostatic potential due to cr(x) just above 
the real axis between a and b, given the tangential derivative for a < x < b (which 
must cancel the tangential derivative of the given external electrostatic potential 
for the charges to be in equilibrium) and the fact that the normal derivative is zero 
elsewhere just above the real axis (and is zero at infinity), with D being the upper 
half Euclidean plane. In our special case, the solution is given by the following 
theorem || i~0f : Given the finite Hilbert transformation 



= i /' 

ix J-i y — x 



the inverse Hilbert transformation is given by 



1 /« l-y 2 f(y)dy C 
m = ~* L V —i-y-^x- + (22) 

where the principal parts are taken for both integrals, and where 

C = - f 1 <j>{y)dy (23) 

7T J-l 

is an arbitrary constant. 

Applying this theorem to our problem, singularities can be avoided at a and b if 

a = 2 + w — + w, b = 2 + w + 2 v / TTw. (24) 

Then the charge density (normalized density of eigenvalues of the typical pa) is 



J— x 2 + 2(2 + w)x — w 2 J (x — a)(b — x) 

a(x) = V y > = y y n 1. (25) 

lux Itxx 

This gives, under our large (m, n) approximation, 

S mn ~ Inn / a(x)x In xdx = Inn / dyyJl — y 2 ln(2 + w + 2\/l + wy) 

2n J a n J -i v 

2 r2n 

= In n / d6 sin 9 In VI + 2r cos^ + r 2 , (26) 

7T JO 
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with 



2x — a — b 




(27) 



y = 



b — a 



The argument of the logarithm of the last integral is the distance from a point on 
the unit circle in the (y, z) plane to a point at distance r along the negative real 
axis from the center. Thus the integral can be viewed as yet another electrostatic 
potential in two dimensions, at y = — r from a sin 2 9 charge distribution around the 
unit circle, which is a monopole plus a quadrupole, and this works out to give 



which is Eq. (10). 

Thus we see that when the dimensions m and n of both subsystems A and B are 
large, and when the joint system is in a random pure state, the smaller subsystem 
A (with dimension m) typically has nearly maximal entropy lnm. The average 
deviation or information in the smaller subsystem is 



and is always less than one half of a natural logarithmic unit. That is, for a typical 
pure quantum state of a large system, the smaller subsystem is very nearly maximally 
mixed, showing little signs that the total system is pure. 

Another way of putting it is to say that if the subsystems A and B were broken 
up into tiny sub-subsystems, which typically would each be very nearly maximally 
mixed, there would be virtually no information is the sub-subsystems considered 
separately. For quantum information, the whole system contains more information 
than the sum of the information in the separate parts, and in this case almost all the 
information giving the precise pure state of the entire system, In m + In n units, is 
in the correlations of the sub-subsystems. The result above shows that for a typical 



S mn ~ Inn — 21nr = lnm , 



(28) 




(29) 
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pure state of the entire system, very little of the information, roughly mj (2n) unit, is 
in the correlations within the smaller subsystem A itself, roughly Inn— In m+m/ (2n) 
units is in the correlations within the larger subsystem B itself, and the remaining 
roughly 2 In m — m/n units of information are in the correlations beween the larger 
and smaller subsystems. 
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